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Abstract 

We have designed an asynchronous chip for contactless 
smart cards. Asynchronous circuits have two power prop- 
erties that make them very suitable for contactless devices: 
low average power and small current peaks. The fact that 
asynchronous circuits operate over a wide range of the sup- 
ply voltage, while automatically adapting their speed, has 
been used to obtain a circuit that is very resilient to voltage 
drops while giving maximum performance for the power be- 
ing received 

The asynchronous circuit has been built, tested and 
evaluated and the results were so convincing that, based on 
the circuits presented a product is being designed. 

Keywords: low-power asynchronous circuits, smart cards, 
contactless devices, DES cryptography. 



1. Introduction 

Since their introduction in the eighties, smart cards have 
been used in a continuously growing number of appli- 
cations, such as banking, telephony (telephone and SIM 
cards), access control (Pay-TV), health-care, tickets for 
public transport, electronic signatures and identification. 
Currently, most cards have contacts and, for that reason, 
need to be inserted into a reader. For applications in which 
the fast handling of transactions is important, so-called con- 
tactless smart cards have been introduced requiring only 
close proximity to a reader (typically several centimeters). 
The chip on such a card must he extremely power efficient, 
since it is powered by electromagnetic radiation. 

Asynchronous CMOS circuits have the potential for very 
low power consumption, since they only dissipate when and 
where active. However, asynchronous circuits arc difficult 
to design at the level of gates and registers. Therefore the 
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high-level design language Tangram was defined [10] and a 
so-called silicon compiler has been implemented that trans- 
lates Tangram programs into asynchronous circuits. Tan- 
gram is a conventional programming language, like C or 
Pascal, extended to include constructs for expressing con- 
currency and communication in a way similar to the lan- 
guage CSP [2J. Similar approaches have been proposed in 
[1,3]. 

The Tangram compiler generates a special class of asyn- 
chronous circuits called handshake circuits [8, 6]. Hand- 
shake circuits are constructed from a set of about 30 ba- 
sic components that use 4-phase single-rail handshake sig- 
nalling for communication. 

Several chips have been designed in Tangram [9, 11] and 
if we compare these chips to existing clocked implementa- 
tions, then the asynchronous versions are generally about 
20% larger in area and consume about 25% of the power. 

In order to find out what advantages asynchronous cir- 
cuit*; have to offer in contactless smart cards, we have de- 
signed the digital circuitry of a smart card chip as well as 
the analog power regulator. In this paper, we indicate which 
properties of asynchronous circuits have been exploited and 
we present the results. The paper is organised as follows. 
Section 2 provides some background to contactless smart 
cards and identifies the power characteristics in which con- 
tactless devices differ from battery-powered ones. Section 
3 elaborates on the differences in power behaviour between 
synchronous and asynchronous circuits and indicates how 
these differences can be exploited in contactless devices. 
The next two sections present the design, section 4 dis- 
cussing the digital circuit and section 5 the power supply 
unit. In section 6 we conclude with the results and a sum- 
mary of the merits of this asynchronous design. 

2. Contactless smart cards 

Fig. I shows the functional parts of a contactless smart 
card consisting of a VLSI circuit (in the dotted box) and 
an external coil. The tuned circuit formed by the coil and 
capacitor Co is used for three purposes: 
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Figure 1. Contactless smart card 



• receiving power; 

• receiving the clock frequency (equal to the carrier fre- 
quency); and 

• supporting the communication. 

The circuit proper consists of a Power Supply U n j, and a 
digital circuit with a buffer capacitor (C.) for storing en- 
ergy. 

Contactless smart cards have a number of advantages 
when compared to contacted ones: they arc convenient and 
fast to use. insensitive to dirt and grease and. since their 
waders ha VC no sIots , hcy m |css ^ to vandMsm 

Since contactless smart card chips receive only a few 
™ WCT ' Pf wcrcffici ^y is very important. Although 
low power is also important in battery-powered devices 
there are two crucial differences between both kinds of de-' 
vices. 

1. To maximize the battery life-lime in battery-powered 
devices, one should minimize the average power con- 
sumption. In contactless devices, however, one should 
•n addition minimize the peak power, since the peaks 
must be kept below a certain level, depending on the 
incoming power as well as the buffer capacitor. 

2. The supply voltage is nearly constant in battery pow- 
ered devices, whereas in contacdess ones it may vary 

ver time during a transaction due to fluctuations in 
both the incoming and the consumed power. 

Several standards for contacdess smart cards, currently 
exist. The main standard, however, is Mifare [4|. which has 

card (u can be used up to 10 cm) supporting two way com- 
munication. The carrier frequency is 13.56 Mhz and the 
communication rate is 106 Kbit/sec. Performance is impor- 
tant, since l he transaction time must be less than 200 msec 
One of the first companies to deploy Mifarc technology en 
T 7 s ,hc ^ Bus Association, which currently has 6 
million bus cards in use, generating 80 million transactions 
per month. 



In the bullets below, we give some facts about conven- 
tional synchronous chips for contactless smart cards which 
as we will see later, offer opportunities for improvemen by 
using asynchronous circuits. X 

♦ A synchronous circuit runs at a fixed speed dictated by 
the clock, despite the fact that both the incoming and 
the effectively consumed power vary over time Syn 
chronous circuits must, therefore, be designed so as 
jo allow the most power-hungry operations to be per- 
formed when minimum power is coming in. Conse- 
quently, if too much power is being received that su- 
perfluous power is thrown away. If, on the other hand 
too little power is being received, the transaction must 
be canceled. If too little power comes in. the supply 
voltage drops making the circuit slower and. as soon 
as the circuit has become too slow to meet the time 
requirements set by the clock, the transaction must be 
canceled. For this reason contactless smart card chips 
contain subcircuits that detect when the voltage drops 
below a certain threshold and then abort the transac- 
tion. 

• Currendy, the performance of the microcontroller in a 
contactless smart card chip is usually not limited by 
the speed of the circuit, but by the RF-power being 
received. 6 

• A synchronous circuit requires a buffer capacitor of 
several nanoFarads and the area needed for such a ca- 
pacitor is of the same order of magniiude as the area 
needed for the microcontroller. 

• The communication from the smart card to the reader 
is based on modulating the load, which implies that 
normal functional load fluctuations may interfere with 
the communication. ' 

3. Differences between synchronous and asyn- 
chronous circuits 

When the asynchronous circuits generated by the Tan- 
gram compiler are compared to synchronous ones, three 
differences stand out. leading to four atiractivc properties 
of asynchronous circuits. 

1. The subcircuits in a synchronous circuit arc clock- 
dnven, whereas they are demand-driven in an asyn- 
chronous one. This means that the subcircuits in an 
asynchronous circuit arc only active when and where 
needed. Asynchronous circuits will therefore gener- 
ally dissipate less power than synchronous ones. 

2. The operations in a synchronous circuit arc synchro- 
nized by a central clock, whereas they are synchro- 
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nized by distributed handshakes in an asynchronous 
circuit. Therefore 

a) a synchronous circuit shows large current peaks 
at the clock edges, whereas the power consump- 
tion of an asynchronous circuit is more uniformly 
distributed over time: 

b) the strict periodicity of the clock in a syn- 
chronous circuit leads to higher harmonics in 
the frequency spectrum, which are absent in the 
spectrum of an asynchronous design. 

3. Synchronous circuits use an external time reference, 
whereas asynchronous circuits are self-timed. This 
means thai asynchronous circuits operate over a wide 
range of the supply voltage (for instance, from 1 up to 
3.3 V) while automatically adapting their speed. 

Property 2.b was the main reason for Philips Semicon- 
ductors to design a family of asynchronous pager chips 1 . 
However, it is the other properties that can be used advanta- 
geously in con tactless smart card chips. Since it is the peak 
power that matters for contactless applications, both advan- 
tage 1 and 2.a are relevant, and property 3 can be used to 
cope with the fluctuating supply voltage, 

4* The digital circuit 

We have built the digital circuit shown in fig. 2 that con- 
sists of: 

• an 80C51 micro-controller; 

• three kinds of low-power memory, of which the sizes 
and accesstimes are given in Table I (64 bytes can 
be written simultaneously in one write access to the 
EEPROM); 

• two encryption co-processors: 

* an RSA converter £7] for public key conversions and 
- a triple DES converter [51 for private key conversions; 

• a UART for the external communication. 

The EEPROM contains program parts as well as data such 
as encryption keys and e-money. Both the ROM and 
the RAM are equipped with matching delay lines and for 
the EEPROM we designed a similar function based on a 
counter. These delay lines have been used to provide all 
three memories with a handshake interface, which made it 
extremely easy to deal with the differences in access time as 
well as variations in both temperature and supply voltage. 

The circuit is meant to he used in a so-called dual inter- 
face card, which is a card with both a contacted and a con- 
tactless interface. Apart from the RSA converter, which will 

1 h*tp://wwv<„WTicoodiicton. philips com/pi p/PCA5007H 
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Figure 2. Global design digital circuit 



not be used in contactless operation, all circuits are asyn- 
chronous. In contactless operation, the average supply volt- 
age will be about 2 V. The simulations, however, are done 
at 3.3 V, which is the voltage at which the library has been 
characterized. 



Memory 
type 


Size 
[kbyte] 


Access time [ns] 


read 


write 


RAM 


2 


10 


10 


ROM 


38 


30 


30 


EEPROM 


32 


180 


4,000 



Table 1. Memory sizes and access times 



4.1. The80C5I micro-controller 

The 80C5 1 micro-controller is a modified version of the 
one described in (II). The four most important modifica- 
tions arc described below. 

To deal with the slow memories a prefetch unit has been 
included in the 80C5 1 architecture. At 3.3 V the average in- 
struction execution time in free-running mode is about 100 
ns provided it takes no time to fetch the code from memory. 
If, however, code is fetched from the EEPROM and the mi- 
crocontroller would have to wait during such read accesses, 
the performance would be drastically reduced, since most 
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Figure 3. Current peaks of 80C51 micro-controller 



msu-ucuons are one or two byte long, taking 180 or 360 
ns to fetch. To avoid this performance degradation a form 
of instruct™ prefetching has been introduced in which a 
process running concurrently to the 80C51 core is fetching 
code bytes as long as a two byte fifo is not full. The prefetch 
unit gives an increase in performance of about 30%. A sim- 

s P ubs2^r° n ° f * C PrCfetCh " dCSCribed in ^ 

tfj^^ introduced «»"> "rite completion, which means 
that the micro-controller continues execution as soon as it 
has .ssued a write access. It has been introduced to prevent 
he imcrocontroller f rom waUing dunng ^ 4 msec ? 

to do a write access to the EEPROM (for instance to change 
Oie e-money) but also to speed up the write accesses to the 

th^EEPROM rh' fea,Ure WhC " doin « a wri,e ac ~« «° 
utc fcfcPROM. the corresponding code must be in the ROM 

The controller has been provided with an inunediate halt 

TT^'r P rov,s,on is "pessary to deal with the fact 
•hac the mformauon. which the reader sends to the card, is 

S^S.S SUp P reSsing Carricr d «ring periods of 3 ,«ec. 
Since the card does not receive any power during Uiese peri- 
od* the controller has to be halted immediately (only some 
basic functions continue to operate) ]n the synchronous 

would stop dunng these periods. 

in 1 Wr0duCcd a so - ca,k d synchn,nous mode, 

n which the micro-controllcris. at instruction level, fully 
Ze 8 ~ mpa,ib ' e Wi,h " s -ychronous counterpart. In this 
"■ode. the asynchronous micro-controller waits after each 
instruction until the number of Cock ticks required by a 
synchronous version liave elapsed. This mode is neces- 
sary when time dependent functions arc designed in soft- 
ware. Since this mode is under software control, the micro- 



controller can easily switch modes depending on the func- 
tion it is execuung. This feature was also of great help 
■o demonstrate the guaranteed performance, which is the 

wunlnZ W , hiCh Cach imion *™nates 

within the given number of clock ticks. For most programs 
toe free-running performance is about twice as high as the 
guaranteed performance. 6 

The micro-controller nicely demonstrates the three prop- 
erties of asynchronous circuits that we want to exploit in 
the design of the smart card chip. We could compare the 
asynchronous version with a synchronous one. where the 
synchronous one gives a comparable performance. 

• Jn^^ VeraE K T C ° nSUmp(ion of lh * asynchronous 
80C51 is about three limes lower than the power con- 
sumption of its synchronous counterpart when deliver- 
ing the same performance at the same supply voltage. 

• Fig. 3 shows the current peaks of both the synchronous 
and the asynchronous 80C5I at 3.3 V. where the 
asynchronous version is running in quasi synchronous 
mode giving a performance that is 2.5 times higher 

K C ^ y " ChrOn0US ° nC (,HC s y«*«>nous one runs 
at iu MHz and the asynchronous one at 25 MHz) 
Despite the fact that the figure does not give a fair 
imprcss.on of the average power being consumed 
it clearly shows diat tlie cunent peaks of the asyn- 
chronous 80C5 1 are about five limes smaller than those 
of the synchronous one. 

• The performance adaptation property of asynchronous 
circuits is demonstrated in fig. 4. which shows the free- 
running performance of the micro controller when ex- 
ecutmg code from ROM. as a function of the supply 
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voltage. As could be expected, the performance de- 
pends linearly on the supply voltage. When the sup- 
ply voltage goes up from 1.5 to 3.3 V, the perfor- 
mance increases from 3 to 8.7 MIPS (about a factor 3). 
Since the ROM containing the program does not func- 
tion properly when the supply voltage is below 1.5 V, 
wc could not measure the performance for lower val- 
ues. We observed, however, thai the DES co-processor, 
which does not need a memory, still functions cor- 
rectly at a supply voltage level as low as 0.5 V. 

The figure also shows the supply current as a function 
of the supply voltage. Note that the current increases in 
this range from 0.7 to 6 mA (about a factor 9). Since in 
CMOS circuits the current is the product of the transi- 
tion rate (performance) and the charge being switched 
per transition (both of which depend linearly on the 
supply voltage), the current increases with the square 
f the voltage. From this it follows that the power, be- 
ing the product of the current and the voltage, goes up 
with the cube of the voltage. 

From this data one can compute the third curve show- 
ing the energy needed to execute an instruction, which 
increases with the square of the supply voltage from 
0.35 to 2.25 nJ. 




*m «.i i* a i tt am xa *m *.« »» 



Figure 4, Performance adaptation of asyn 
chronous 80C51 



4.2. The prefetch unit 

Fig. 5 gives the Tan gram code of a simplified version of 
the prefetch unit The prefetch unit communicates with the 
80C51 core through two channels: it receives the address 
from which to start fetching code bytes via channel StartAd- 
dress and it subsequently sends these bytes through channel 



CodcByte. Since the prefetch unit plays in both communi- 
cations the passive role, it can probe each channel to see 
whether the core has started a communication through that 
channel. The state of the prefetch unit consists of program 
counter pc, and a two-place buffer, which is implemented 
by means of an array Buffer \ an integer count, and two one- 
bit pointers getptr and putptr. 

forever 

do sel probe (Start Address) 

then Start Addressee || putptr := getptr || 
count :— 0 || AbortMemAccQ 
or probe(CodeByte) A (count > 0) 
then CodeByte\Buffer[getptr]; 
( getptr := next(getptr) 
|| count := count — 1 
) 

or MemAck 

then Buffer \putptr] := MemData; 
( putptr next(putptr) 
|| count := count + 1 
|| pc := pc + 1 || CompleteMemAccQ 
) 

les; 

if (count < 2) A (^MemReq) 
then MemReq := true 
fi 

od 

Figure 5. Tangram code of simplified version 
of prefetch unit 

The prefetch unit executes an infinite loop and in each 
step it first executes a selection command, in which it can 
select among three so-called guarded commands. Each 
guarded command is of the form "guard then comniand\ 
the guarded commands are separated by the key word "or" 
and the list is enclosed by the bracket pair "set" and *1es". 
A command is said to be enabled if the corresponding guard 
holds. Executing a selection command implies waiting un- 
til at least one of the commands is enabled, then select- 
ing, in an arbitrated choice, such a command and executing 
it. In the first guarded command, channel Start Address is 
probed to find out whether the core is sending a new start 
address. In that case, pc is sel to the address received, the 
butter is flushed and a possible outstanding memory ac- 
cess is aborted (by resetting both MemReq and the delay 
counter). All four subcommands arc executed concurrently 
( '\4||£T" means execute A and B concurrently, whereas 
*\4; means execute A and B sequentially). The second 
guarded command takes care of sending the next program 
byte via channel Code Byte to the core if the core is ready 
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to receive that byte and the buffer is not empty. The third 
guarded command gets enabled if MemAck goes high indi- 
cating that the data signals in a read access arc valid. In that 
case the value read from memory is put in the buffer after 
which the memory handshake is completed. 

After each event a next memory access is started in 
case the buffer is not full while no memory access is be- 
ing performed (since (counr < 2) V (^MemReq) holds 
the last command in the loop step can be simplified to 
M emReq := (count < 2)). 

Note that the value (pc - count) is equal to the program 
counter in the core, since it is set to the destination address 
in case of a jump, increased by I if a code byte is transferred 
Co the core, and kept invariant if a code byte is read from 
memory. Therefore the core does not need to hold the pro- 
gram counter and instead, when the information is needed 
for a relative branch, it can retrieve the counter value from 
the prefetch unit. Clearly, this feature requires an extension 
or the Tangram code shown in fig. 5. 

4.3. The DES co-processor 




Figure 6. DES co-processor architecture 

A transaction may need up to ten single DOS conver- 
sions, where each conversion takes about 10 msec if it is 
executed in software. Therefore a hardware solution is 
needed, since these conversions would otherwise consume 
about half of the transaction time. 

Fig. 6 shows the data path of the DCS co-processor The 
processor supports both single and triple-DES conversions 
and, for the latter type of conversion, contains two keys- a 
foreground and a background key. Single-DES conversions 
use the foreground key, whereas triple-DES conversions use 
the foreground key for the first and third conversion and the 



background key for the second one. The foreground key is 
stored in register CDO consisting of 56 flipflops (the DES 
key size is 56 bits), whereas the background key resides 
in variable CD! consisting of 56 latches. The text value 
resides in variable LR consisting of 64 latches (DES words 
contain 64 bits). 

A singlc-DES conversion consists of 16 steps and in 
each step, both the key is permuted and a new text value 
is computed from both the old one and the key. In order to 
have the key return to its original value at the end of a con- 
version, the key makes two basic permutations at 12 steps 
and only one at the remaining 4, where 28 basic permuta- 
uons are needed for a complete cycle. The permutations are 
performed in flipflop register CDO. 

,^f > 5? Carea is taken b * lhe combinational circuit 
called DES. Since this circuit is also dominant in power dis- 
sipation, one should minimize the number of transitions at 
its inputs. For this purpose, we have introduced two latch 
registers: cd for the key and Ir for the text. If two basic 
permutations are done in one.step, cd hides the effect of the 
first one from combinational circuit DES. In addition all in- 
puts of combinational circuit DES change only once in each 
step by loading the two registers It and cd simultaneously 
and then storing the result in register LR as described by 
the following piece of Tangram text. 

(/r := LR || cd := CDO); LR := DES{lr,cd). 
Therefore, latch register It also serves as a kind of slave 
register. Latch register cd also serves a functional purpose 
since the two keys are swapped by executing the following 
three transfers: 

cd := CDO; CDO := CDl ; CDl := cd. 

The size of the DES co-processor is 3,250 gate equiva- 
lents, of which 57% is taken by the combinational logic and 
35% by latches and flipflops. Consequently, the overhead in 
area due to the asynchronous design style (delay matching 
and C-elements) is marginal at 8%. At 3.3 V, a single-DES 
conversion takes 1 .25 /*s and 1 2 n J. 

Fig. 7 shows the simulated current of the DES co- 
processor at 3.3 V (the micro-controller is active before and 
after the DES computation). The real current peaks will be 
much smaller due to a lower supply voltage (the DES pro- 
cessor functions properly at a supply voltage as low as 0 5 
V) as well as the buffer capacitor (the resolution in the sim- 
ulation is I ns). 

The conversion lime, of a few microseconds is so 
small that we used the handshaking mechanism to' obtain 
the synchronization between the micro-controller and the 
co-processor. After starting the co-processor, the micro- 
controller can continue executing instructions and only 
when reading the result, will it be held up in a handshake 
until the result is available. Note that a synchronous design 
would require a form of busy wailing. 
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Figure 7. Current DES co-processor at 3.3 V 
4.4. Results 

Fig. 8 shows the layout of the chip, which is in a five- 
layer metal, 0.35 /im technology and has a sire of 4.52 x 
4.16 % 18 mm 2 , including the bonding pads. Many bond- 
ing pads arc only included for measurement and evaluation 
purposes. A production chip only needs about 10 bonding 
pads. 




Figure 8. Layout of smart card chip 

The two horizontal blocks on top form the buffer capac- 
itor (in a production chip, such a capacitor would only re- 
quire about one quarter of the area size). Tlie memories are 
on the next row, from left to right: two RAMs. one ROM 
and the EEPROM, which is the large block to the right. The 



asynchronous circuit is located in the lower left quadrant, 
near the center. 

Table 2 gives the area f the blocks constituting the con- 
tactless digital circuit, which is the asynchronous circuit 
together with the memories. The other modules are ei- 
ther synchronous or analog circuits, where tlie synchronous 
modules arc not used in contact less operation. From this 
table it follows that tlie asynchronous logic takes only 12% 
of the total contactless digital circuit. 



Block 


Area [mm J ] 


RAM 


1.2 


ROM 


1.0 


EEPROM 


5.6 


Async. circ. 


1.1 


Total 


8.9 



Table 2. Area contactless digital circuit blocks 



The sizes of the different asynchronous modules are 
given in Table 3. In the standard cell library used, a gate 
equivalent (GE) is 54 /xm 2 with a typical layout density of 
17,500 gates per mm 2 . 



Module 


Area [GE] 


CPU 


7,800 


Pref. Unit 


700 


DES 


3,250 


UART 


2,040 


Interfaces 


3,680 


Timer 


1,080 


Total 


18,550 



Table 3. Area asynchronous modules 

Table 4 shows the power dissipation of the digital cir- 
cuit blocks when the controller is executing code from ROM 
(being the normal situation). 



Block 


Power ( %1 


Core 


56 


ROM 


27 


RAM 


17 



Table 4. Power of contactless digital circuit 

Table 5 shows the effects on power and area of an asyn- 
chronous design at two different levels. Tlie asynchronous 
circuit proper gives a reduction in power dissipation of 
about 70% for 18% additional area. At the level of the 
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coniactless digital circuit, however, we obtain a power re- 
duction of 60% for only 2% additional area. Note that this 
analysts does not include the synchronous RSA convener 
and the analog circuits needed in a production chip, such as 
for instance the buffer capacitor and the power supply unit 
Therefore at chip level the relative reduction in power dissi- 
pation will be about the same, whereas the overhead in area 
will be reduced even further. 



Level 1 


Power [%] 


Area [%] 


Async. circ. 
Async. + Mem. 


-70 
-60 


+18 
+2 



Table 5. Effect of asynchronous design on 
power and area on different levels 



5. The power supply unit 

Fig. 9 shows the power supply unit consisting of a recti- 
fier and a power regulator, which are both completely ana- 
log circuits. The design of the rectifier is conventional, and 
f the regulator we discuss only those aspects of the be- 
haviour that are relevant to the design of the digital circuit 
without going into the details of its design 



J?eot 


±0 


Pow 


±1 


vO 






vl 







Figure 9. Power Supply Unit 

To avoid interference wiih che communication, a power 
regulator has been designed that shows an almost constant 
load at us input. Fig. 10 shows Spice-level simulation re- 
sults of such a power regulator when the input voltage V 0 
•s fixed at 5V. On the horizontal axis we have the activ 
uy (number of transitions per second) of the digital circuit 
ITic input load is almost constant, since input current i 0 i s 
almost constant over the whole range. 

When the activity is low. output voltage V, is constant 
at about 3V. In this range, too much power is coming in 
and the regulator functions as a voltage source with output 
current t, increasing when the activity increases Tlie su- 
perfluous power is shunted to ground. However. », reaches 
a saturation point when .1 rcacl.es i 0 . From this point on. 



no more power is shunted to ground and the regulator stark 
to runction as a current source with output voltage V, de 
creasmg when the activity increases. The regulator delivers 
maximum power in the middle of the range where both the 
outgoing voltage and the outgoing current are high. Note 
that these simulation results assume constant incoming RF- 
powcr. The variations in the incoming RF-powcr during a 
transaction, however, arc an additional source for fluctua- 
tions .n V t , since these variations result in shifts of the sat- 
uration point. 
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Figure 10. Power regulator behaviour 

A power source with these characteristics burdens the 
designer of a synchronous circuit with the problem of trad- 
ing off between performance and robustness. Going for 
maximum performance means assuming a supply voltage of 
3 V , n which case a transaction must be aborted if the volt- 
age drops below 2.5 V, for instance. On the other hand, if he 
opts for a more robust design by choosing 2 V as the operat- 
ing voltage, performance is lost when the regulator delivers 
3 V. Such trade offs are not needed for an asynchronous 
circuit, since it always automatically gives the maximum 
performance for the power received. 

6. Conclusions 

We have designed, built and evaluated an asynchronous 
chip for contacllcss smart cards in which we have exploited 
the fact that asynchronous circuits: 

• use little average power. 

• show small current peaks, and 

• operate over a wide range of the supply voltage. 

Measurements and simulations showed the following ad- 
vantages of this design when compared to a conventional 
synchronous one. 
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• The asynchronous circuit gives the maximum perfor- 
mance for the power received. This comes mainly 
from the fact that the asynchronous design needs less 
of what is the main limiting factor for the performance, 
namely power. Compared to a synchronous design, the 
asynchronous circuit needs about 40% of the power 
for less than 2% additional area. In addition, the auto- 
matic speed adaptation property of asynchronous cir- 
cuits saves the designer from trading off between per- 
formance and robustness. Due to this property the 
asynchronous circuit will give free-running instead of 
guaranteed performance, where the difference between 
the two is about a factor two. 

• The asynchronous design is more resilient to voltage 
drops, since it still operates correctly for voltages down 
to 1.5 V. 

• The current peaks of an asynchronous circuit arc less 
pronounced, making the requirements with respect to 
the buffer capacitor more modest. 

• The combination of the power regulator with the asyn- 
chronous circuit gives little communication interfer- 
ence. In this case, the smaller current peaks and the 
self-adaptation property are of importance. 

These advantages were so convincing that, based on the 
circuits presented, a product (chip for dual interface card) is 
being designed. 

Acknowledgements 

We gratefully ackowledge the other members of the 
Tan gram team: Kees van Berkel, Marc Verra and Erwin 
Woutersen, and we thank Klaus Ully for helping us to get 
the DES converter right. This work was funded by the Eu- 
ropean Commission under Esprit TCS/ESD-LPD contract 
255 19 (Design Experiment DESCALE). 

References 

(I I E. Brunvand and R. Sproull. Translating concurrent pro- 
grams into delay-insensitive circuits. In IEEE int. Conf. on 
Computer Aided Design, pages 262-265, 1989. 

[2J C A. R. Hoarc. Communicating Sequential Processes. 
Prentice-Hall International. Series in Computer Science 
1985. 

[3J A. J. Martin. Compiling communicating processes into 

delay-insensitive VLSI circuits. Distributed Computing, 

l<4):226-234. 1986. 
[4} Identification cards - com act less integrated circuits) cards - 

proximity cards. Standard ISO/ICC 1444. 
f5| National Bureau of Standards. Data encryption standard. 

Jan. 1977. Federal Information Processing Standards Puhli- 

cation 46. 



[6 1 A. Pccters. Single-Rait Handshake Circuits. PhD thesis, 
Eindhoven University of Technology. 1 996. 

(7) R. Rivcst. A. Shamir, and L. Adleman. A method for obtain- 
ing digital signatures and public-key cryptosystems. Comm. 
of the ACM,2\: 120-126, June 1978. 

van Berkel. Handshake circuits: un asynchronous archi- 
tecture, for VLSI programming. Cambridge University Press. 
International Series on Parallel Computation. 1993. 

(9J K. van Berkel. R. Burgess, J. Kcssels. A. Peeters, M. Ron- 
cken, and F. Schalij. Asynchronous circuits for low power: 
a DCC error corrector. IEEE Design and Test. 1 1 ( 2): 22-32 
June 1994. 

[10J K. van Berkel. J. Kessels, M. Roncken, R. Saeijs. and 
F. Schalij. The VLSI-programming language Tangram and 
its translation into handshake circuits. In Proc. Eur. Conf. on 
Design Automation, pages 384-389, Amsterdam, 1991. 

[II] H. van Gagcldonk, K. van Berkel. A. Pccters, D. Baumann. 
D. Gloor. and G. Stegman. An asynchronous low-power 
80c5l microcontroller. In Proc. Int. Symp. Advanced Re- 
search in Asynchronous Circuits and Systems, pages 96- 
107, San Diego, 1998. 



44 



X:iO <XP 1037733SA I > 



-4 — . »- 



